Basic Statistics

Raw Counts

Name Value
Rows 16,290
Columns 96
Discrete columns 20
Continuous columns 76
All missing columns 0
Missing observations 44,497
Complete Rows 13,855
Total observations 1,563,840
Memory allocation 12.2 Mb

Percentages

Data Structure

Missing Data Profile

Univariate Distribution

Histogram

Bar Chart (with frequency)

## 7 columns ignored with more than 50 categories.
## date: 16204 categories
## raceomb_003sub_white: 111 categories
## occuSelfDetail: 100 categories
## MSANo: 401 categories
## CountyNo: 190 categories
## MSAName: 401 categories
## STATE: 54 categories

QQ Plot

## Warning: Removed 34 rows containing non-finite values
## (`stat_qq()`).
## Warning: Removed 34 rows containing non-finite values
## (`stat_qq_line()`).

## Warning: Removed 474 rows containing non-finite values
## (`stat_qq()`).
## Warning: Removed 474 rows containing non-finite values
## (`stat_qq_line()`).

## Warning: Removed 450 rows containing non-finite values
## (`stat_qq()`).
## Warning: Removed 450 rows containing non-finite values
## (`stat_qq_line()`).

## Warning: Removed 495 rows containing non-finite values
## (`stat_qq()`).
## Warning: Removed 495 rows containing non-finite values
## (`stat_qq_line()`).

## Warning: Removed 495 rows containing non-finite values
## (`stat_qq()`).
## Removed 495 rows containing non-finite values
## (`stat_qq_line()`).

## Warning: Removed 508 rows containing non-finite values
## (`stat_qq()`).
## Warning: Removed 508 rows containing non-finite values
## (`stat_qq_line()`).

## Warning: Removed 275 rows containing non-finite values
## (`stat_qq()`).
## Warning: Removed 275 rows containing non-finite values
## (`stat_qq_line()`).

Correlation Analysis

## 14 features with more than 20 categories ignored!
## date: 13780 categories
## raceombmulti: 36 categories
## raceomb_003sub_asian: 29 categories
## raceomb_003sub_black: 30 categories
## raceomb_003sub_hispanic: 32 categories
## raceomb_003sub_middleeast: 21 categories
## raceomb_003sub_white: 110 categories
## occuSelf: 25 categories
## occupation_self_002: 25 categories
## occuSelfDetail: 100 categories
## MSANo: 398 categories
## CountyNo: 187 categories
## MSAName: 398 categories
## STATE: 54 categories
## Warning in cor(x = structure(list(session_id =
## structure(c(2668817320, 2668818749, : the standard
## deviation is zero

Principal Component Analysis

## 7 features with more than 50 categories ignored!
## date: 13780 categories
## raceomb_003sub_white: 110 categories
## occuSelfDetail: 100 categories
## MSANo: 398 categories
## CountyNo: 187 categories
## MSAName: 398 categories
## STATE: 54 categories
## Warning in plot_prcomp(data = structure(list(session_id = structure(c(2668817320, : The following features are dropped due to zero variance:
##  * year
##  * N_3467
##  * Side_Good_34
##  * N_3
##  * N_4
##  * N_5
##  * N_6
##  * N_7
##  * study_name_Demo.Race.0006